PESA: Prioritized experience replay for parallel hybrid evolutionary and swarm algorithms - Application to nuclear fuel
نویسندگان
چکیده
We propose a new approach called PESA (Prioritized replay Evolutionary and Swarm Algorithms) combining prioritized of reinforcement learning with hybrid evolutionary algorithms. hybridizes different swarm algorithms such as particle optimization, evolution strategies, simulated annealing, differential evolution, modular to account for other three by storing their solutions in shared memory, then applying redistribute data between the integral frequent form based on fitness priority values, which significantly enhances sample diversity algorithm exploration. Additionally, greedy is used implicitly improve exploitation close end evolution. features balancing exploration during search parallel computing result an agnostic excellent performance over wide range experiments problems presented this work. also shows very good scalability number processors solving expensive problem optimizing nuclear fuel power plants. PESA's competitive modularity all allow it join family algorithm; unleashing optimization.
منابع مشابه
Distributed Prioritized Experience Replay
We propose a distributed architecture for deep reinforcement learning at scale, that enables agents to learn effectively from orders of magnitude more data than previously possible. The algorithm decouples acting from learning: the actors interact with their own instances of the environment by selecting actions according to a shared neural network, and accumulate the resulting experience in a s...
متن کاملPrioritized Experience Replay
Experience replay lets online reinforcement learning agents remember and reuse experiences from the past. In prior work, experience transitions were uniformly sampled from a replay memory. However, this approach simply replays transitions at the same frequency that they were originally experienced, regardless of their significance. In this paper we develop a framework for prioritizing experienc...
متن کاملReward Backpropagation Prioritized Experience Replay
Sample efficiency is an important topic in reinforcement learning. With limited data and experience, how can we converge to a good policy more quickly? In this paper, we propose a new experience replay method called Reward Backpropagation, which gives higher minibatch sampling priority to those (s, a, r, s′) with r 6= 0 and then propagate the priority backward to its previous transition once it...
متن کاملEfficient Data Mining with Evolutionary Algorithms for Cloud Computing Application
With the rapid development of the internet, the amount of information and data which are produced, are extremely massive. Hence, client will be confused with huge amount of data, and it is difficult to understand which ones are useful. Data mining can overcome this problem. While data mining is using on cloud computing, it is reducing time of processing, energy usage and costs. As the speed of ...
متن کاملHybrid artificial immune system and simulated annealing algorithms for solving hybrid JIT flow shop with parallel batches and machine eligibility
This research deals with a hybrid flow shop scheduling problem with parallel batching, machine eligibility, unrelated parallel machine, and different release dates to minimize the sum of the total weighted earliness and tardiness (ET) penalties. In parallel batching situation, it is supposed that number of machine in some stages are able to perform a certain number of jobs simultaneously. First...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Nuclear Engineering and Technology
سال: 2022
ISSN: ['1738-5733', '2234-358X']
DOI: https://doi.org/10.1016/j.net.2022.05.001